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nPTTMT7.T NG BANDWIDTH CONSUMP TIO N FOR ntKHJMK XTT DIS TRIBUTION 
G VHR A MULTI CAST, ENABLED WIDE AREA NETWORK 

The present U.S. palcnl application is related to U-S. patent applicaLum number 
09/216,018 (P002), entitled "A Method and Apparatus far Supporting a Multicast 
Response to a Unicast Request for Documents," which is assigned to the corporate 
assign cc of the present invention. 

Fi F r T n OF THE TNVRN'i 1QK 

The present invention relates t[> computer networks used for transmitting and 
distributing documents in the form of a eollceimn ofdigttai data. More specifically, ihe 
present invention relates to a method and apparatus tor dynamically and intelligently 
caching documents at a local silc utilizing an agent to iliter incoming multicast streams of 
documents. 

BACKGROUND OKTHR INVENTION 

The Internet has become an expansive backbone of communication with a vast 
repository of information available in various formats. The reason for its popularity is due 
in iaryu pan to Ihc development of tbs hypertext mark-tip language (HTML), its related 
formats such as extensible mark-up language (XML), dynamic HTML (DI ITMT and the 
associated Transmission Control Protocol/Internet Protocol (TCP/IPj based 
communications protocols Hyper Text Transfer Protocol, HJc Transfer Protocol (FTP), 
and User Datagram Protocol (UUP). HTML has transformed the Internet from the hlack 
and while world of text into the vibrant multi-media environment that it is today. HTML 
defines the syntax and placement of special, embedded directions that instruct a web 
browser on howlo display the contents of u document which is made up of one more 
HTML text files which in (urn reference associated media files, like Joint Photographic 
Hxpens Group (JPEG) graphics, Graphics Interchange FormaL (GIF) animation files, or 
other embedded files of any number of formats including text, images, and other support 
media. HTML instructs a web browser client application on how to make a document 
interactive through special hypertext links or though embedded programs like Java 
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applets, which connect a document with other documents, n$ well as with other Internet 
resources. In addition, embedded programs can and often do contain their own interactive 
logic in the form of executable code and the associated resources. HTML and 
similar/related technologies are responsible for unifying pictures, sounds, and text in a 
document allowing the presentation of rich* multi-media filled weh pages on a wide 
variety of computer based display devices and appliances that have propelled the Internet 
as a new medium for worldwide information exchange and commerce. 

The full potential of the Internet as a medium for communication, education, 
entertainment and commerce remains unfulfilled due lo problems with its performance and 
reliability. The Internet's performance limitations stem from its basic architecture, which 
is not optimised for distribution of data-iuteiisive multimedia content. Internet 
performance is currently limited by 11>e weakest link in the cliain between the client and 
the server. BoltlcncelvS may be caused by the "last-mile" connection to the user, the 
infrastructure of Hie Internet Service Provider (ISP), the gateway to the Internet backbone, 
or the content provider's Web server. For example, the Internet frequently becomes 
overloaded when transmit in y the same data streams from popular Web site servers to 
millions of individual users, 

II is generally true thai wide area network (WAN) connections cost more and are 
slower then local area network (J.. AN) connections. What is needed is an e Elective method 
and apparatus lor minimising or eliminating redundant document transmissions across 
wide urea connections. Document transmission may include the transmission of data files 
or a collection of data Jiles- It may inciude the transmission of text, audio, media, 
embedded program*, executable code, or other data that is published a I a host server. In a 
wide area network of geographically dispersed homogeneous information consumers, the 
need fur, such an invention is most apparent. This is even more so when the users are 
consuming liigh-bandwidlh data like video or audio files. 

SUMMARY 

A method and apparatus for dynamically filtering documents transmitted on one or 
more multicast channels according to a first embodiment of the present invention is 
disclosed. A document is received off of a multicast channel It is determined whether the 
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document includes relevant information. The document is processed if ihc document 
includes relevant information. 

A filtering agent according to a second embodiment of the present invention is 
disclosed. The filtering agent includes a session identification reading unit thai retrieves a 
session identification from a document multicasted over a multicast channel which 
identifies the original requester of the document. The filtering agent includes an 
information classification unit that retrieves published mela data and generates new meta 
data for each document transmitted which, in pail* it buses its relevance decisions. The 
filtering agent includes a source unit that retrieves source information from the document. 
An evaluation unii is coupled to Che session identification reading iinit T information 
classification unit- and the source unit. The evaluation unit determines whether the 
document includes relevant information based on the session Identification, data 
information the source information, and die channel of distribution. Based on the results 
of the evaluation unit, the filtering agent either stores the document locally for Jailer use. 
forwards the document to the original requestor if that user resides within the receiving, 
location's local area network, or discards the document as nut relevant to lire local users. 

BR1FF DESCRIPTION Of 'I'llK D RAW INGS 

The present invention is illustrated by way of example and not by way of limitation 
in the figures of the accompanying drawings, in which like references indicate similar 
elements and in which: 

Figure 1 is a block diagram of a configuration of a network* according to an 
embodiment of the present invention; 

Figure 2 is a block diagram of a local computing resource according to an 
embodiment of the present invention; 

Figure 3 is a block diagram of a network operations center according to an 
embodiment of the present invention; 

Figure 4 is a block diagram of si computer system according to ;m embodiment of 
the present; 

Figure 5 h a block diagram of a document manager residing in a master proxy 
server according to an embodiment of the present: 

3 
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Figure 6 is a block diagram of a filtering agcnl residing in a local computing 
resource according to tin embodiment of the present invention; 

Figure 7 is a flow cluirt illustrating a method for managing documents in a local 
com puling resource according to an embodiment of [he present invention; 

Figure 8 is a flow chart illustrating a method for filtering documents in a local 
computing resource according to an embodiment of the present invention; and 

Figure 9 is a flow chart illustrating a method for managing documents in a network 
operations center according to an embodiment of the present invention. 

DKfAII.HU INSCRIPTION 

A method and apparatus for optimizing bandwidth utilization for document 
distribution over a wide area network is disclosed, ht the fallowing detailed description, 
numerous specific details are set forth in onjcT to provide a thorough understanding of the 
present invention. It would be understood by those skilled in the art that the present 
invention may be practiced without these specific details. In other instances, wcJJ-known 
methods, procedures, components, and circuits have not been described in detail so as not 
lo obscure the present invention. 

Figure 1 is a block diagram of a configuration of a network 100 according to an 
embodiment of the present invention, Block 140 represents a transmission medium. The 
transmission medium HO transmits documents between computer system* coupled to the 
transmission medium HO. The transmission medium 140 may be implemented by a 
twisted pair telephone line, cable, fiber optics line, satellite transmission medium, or other 
transmission medium or combination of media. According to an embodiment of the 
present invention, the transmission medium 140 is the Internet. 

Block 150 represents a server computer system connected to the Internet 140. S he 
server computer system 150 may be, for example, be one or several Hyper Text Transfer 
Protocol (HTTP) server* thai store HTML and HTML associated files. Alternatively, the 
server computer system 1 50 may he another type of server that stores other types of 
network resources and operates as a content host tliat transmit* documents via other 
protocols, 

-1 
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Blocks 1 1 0 and 1 1 1 represent local computing resources. Each of the local 
computing resources; J 10 and 1 1 1 may include it single computer .system, or a plurality of 
computer systems connected together in a local area, network or wide area tietwork. The 
local area network or wide area network may he managed by a local server. 

A network operations center 120 iR coupled to the local computing resource 1 10 
via connection 120 and the local computing resource 111 via connection 121. 'Hie 
connections 120 and 121 may be implemented individually as a twisted pair telephone 
line, cable, fiber optics line, satellite transmission, or other connection media. Kach 
connection 120 and 121 may include one of or a combination of'lhe described connection 
media so long as the connection supports the routing of multicast packets on assigned 
multicast channels or addresses. The network operations center 150 is also coupled to the 
Internet 140. The network operations center 130 operates to provide the local computing 
resources 1 10 and 1 1 1 with access to the Internet 140 and access to network resources. 
The network operations center 1 30 may operate as an T$P to the local computing resoiucca 
1 10 and ill. According to k» embodiment of die present in vent ion, die network 
operations center 130 fetches mid caches and latter -on delivers documents from the server 
1 50 to the Incal computing resources HO and ! ) ) when requested. The network 
operations center 130 multicast* documents requested by one of the local computing 
resource 1 10 or 1 11 to the non-requesting local computing resource 110 or 1 1 1. By 
sharing documents that contains relevant information to die no n -requesting local 
computing resource, the network operations center T30 provides the local computing 
resource 3 10 and 1 1 1 with documents that users on hath the local computing resources 
1 1 0 and 1 1 3 arc likely or guaranteed to consume at some point of time. Sharing 
documents allows accessing documents in the network 100 to be more efficient It also 
delivers a more responsive user experience and facilitates the delivery of high bandwidth 
anriVortiine dependent documents. The local computing resources 110 and III filter the 
incoming documents from the network operations center 1 30. Before storing and/or 
forwardmy documents received from the network operations eenler 130. the local 
computing resources 1 JO and 1 1 1 make sure that the documents ^re relevant for the 
purposes of the local computing resources 1 1 0 and 1 1 1 and has a probability of being used 
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or has been transmitted in response to a local users request. This allows the storage 
device in the local computing resources 1 10 and 1 1 1 to be used efficiently. 

It should be appreciated that any number of local computing resources may be 
connected to the network operations center HO and that any number of network operations 
centers may be connected to the Internet 1 M). ft should also be appreciated that any 
number of server computer systems may be connected to the Internet 1 40, 

Figure 2 is a Muck diagram of the lueal computing resource 1 1 0 according to an 
embodiment of the present invention. The local computing resource 1 10 shown in Figure 2 
may also be implemented as the local computing Tefiource 111 (shown in Figure I). The 
local computing resource 1 10 includes a plurality of client terminals 210-213. The client 
terminals 210-213 may be general purpose computer systems, set-top boxes, or other types 
of client terminal devices. Each client terminal includes at least an input device and a 
display device. Client terminals 210-21 3 are connected together in a network via network 
connection 220 which should, but docs no I necessarily need to support multicast 
transmissions. 

The local computing resource J 10 includes a loca! server 230 which in another 
embodiment could be a logical process on a single users machine. The local server 230 
operates as a local caching proxy saver that runs communications proxy and 
storage/cache services for the client terminals 210-21 3, The proxy servicer; take requests 
for Internet services, such ns I 1TTP % FTP, and Telnet, from the client terminals 21 0-21 3 
and forwards them lo the network operations center 130 (shown hi Figure 1) as 
appropriate according to the protocol of the local computing resource 110, "The proxies 
provide replacement, re-directed or intermediary conn eel ions mid act as Gateways to the 
services. According lo an embodiment of the present invention, the local server 230 
includes a storage device (not shown) that stores documents pre-fclched or previously 
fetched by the network operations center 1 3D. When a HTTP request is made by a client 
terminal, the proxy service run by the local server 230 checks the sturagc device to 
determine whether (he documents requested by the client terminal arc cached in the 
storage device, If The documents are cached in the storage device, the documents from the 
storage device are sent to the client terminal. If the documents arc not cached in the 
storage device, the proxy service forwards the request to the network opera lions center 

6 
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130. In an alternate embodiment, Ihe network connection 120 itself is interconnected Lo the 
Interna 14f>, with a reply-to address af a network ccmimunlcadons unit 250. 

The network communications unit 250 is coupjed to the local server 230. The 
network communications unit 250 operates tu connect the local server 230 io the network 
operations center 1 30 via connection 120. The network communications unit 250 may 
include a telephone modem, cable modem, satellite receivcx/lrunsniitten router, or other 
communication devices. The network communications unit 250 may include one or a 
plurality of the described communication devices. The network cnninnmicaiions device 
supports both uuicust and multicast connections. 

The local computing resource 1 10 may optionally include an inpul/output device 
240. The input/output device 240 may be coupled lo die local server 230 as shown in 
Figure 2. Alternatively, the input/output device 240 may be coupled In the network 
connection 220.. to the network communication unit 250> or lo a client terminal 210-213. 
The inpui /output device 240 may include one or more primers* display video monitors, 
video cameras, or other tnput/oulpul devices. 

Figure 3 is a b!«ck diagram of a network operations center 130 according to mi 
embodiment of the present invention. The network operations center 1 30 includes a 
router/firewall 310. MIjc router/fire wall 3 JO connects the network operations ctmicr 130 to 
the Interna 140. The router/firewall j 10 operates To prevent unauthorized nccess to the 
network operations center 130. The router/firewall 310 analyzes all messages entering Lhe 
ncLwnrk operations center 130 to determine whether they meet specified security criteria. 

The nc (work operations center 130 includes a network communications unit 315. 
The network communications unit 315 operaica u> connect the network operations center 
130 to the local computing resources 1 10 and I f f via connections 120 and 121. The 
network communications tinil 3 1 5 may include a telephone modem, cable modern- satellite 
receiver/transmitter, muter, or other cnmmimi cation devices. The network 
communications unit 3 15 may include one of or a plurality of the described 
communication devices. The network coximiunieaiioos device supports bnlh unieasl and 
multicast com ice Lions. 

The network operations center 130 includes a plurality of servers that perform 
specific functions Rt the network operations center 130. According to an embodiment of 

7 
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Ihc present invention where tlie network operations center 130 includes a plurality of 
computer systems connected in fi network, each server may be implemented by a single or 
a plurality of computer syslcms. Alternatively, in an embodiment, of the prcscnL invention 
where the network operations center 130 is a single computer system, the servers may be 
implemented in software as software modules or 3n hardware as individual hardware 
coinponenls- 

ln one embodiment the network operations center 130 includes a database server 
320 that manages data hi a network operations center data storage 32.5. Data in the 
network operations center data storage 325 may include data corresponding to users with 
access 1o client terminals, statistical data on and- profiles of the users, privileges granted to 
the users, Advertisement data, lo^s corresponding to user activities, hilling status dala : 
preference data, and other types of data. In one embodiment, this database would likely 
also ho id the list of trusted sources of data/documents. 

In one embodiment, the network operations center 130 includes a statistics server 
330. The statistics server 330 interfaces with data stored in the network operations data 
storage 325 to generate real-time usage statistics. For example, the statistics server 330 
may access an activity log and a statistics database in the network operations center data 
.storage 325 and generate real -time usage statistics. 

In one embodiment- the network operations center includes an applications server 
340. The applications server 340 supports web-bascd services such as an HTML enabled 
e-mail systems, chut systems, discussion boards, ami communication gateways, The 
application server 340 may also support the server side of client/server applications. The 
applications server 340 may reside inside the network operations center 130 as shown in 
Pigure 3, or alternatively reside outside of the network operations center ] 30. 

In one embodiment, the network operations center 130 includes an advertisement 
server 350. The advertisement server 350 schedules and manages advertisement 
placements thai, are displayed on client terminals 2J 0-2 1 3 (shown in Figure 2} at local 
computing resources 11 (J and 120 (shown in figure I). It shook! be appreciated that in an 
alternate embodiment of the present invention, the functionality of the advertisement 
server 350 may he implemented in a middleware server ,170, Additional uses and details 
of the advertisement server 350 can be found in U.S. Patent application number 

H 
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09/227,476 (POOS), entitled "Micro Targeted Advertising," winch is assigned to the 
corporate soignee of the present invention. 

[n one emlxnliment, the network operations center 130 includes a weh server 360. 
The web server 360 publishes HTML documents and related fifes via TCP/IP, HTTP, 
FTP, UDP, and other protocols. 

In one embodiment, the network operations center 130 includes a middleware 
server 370. The middleware server 370 interfaces with the database server 320, stalisties 
server 320, application* server 340, usually via the web server 360. in response to requests 
from Lhe client terminals 210-213 to dynamically publish web pages, send and receive 
tlncumenty to and from client terminals, as well as to establish and track user sessions on 
client terminals. The middleware server 370 interfaces with the database server 320 to 
store, log> retrieve, and process session related data. The middleware server 370 nteo acts 
as a communication Lnj-m proxy for remote applications and serves to associate sessions 
with users, and terminal entities so as to associate logged us«ige with users. Additional 
uses and details on the it! en location of sessions can be found in U.S. Patent application 
number 09/21 3,614 (POOS), entitled "ENTITY MODEL RELATIONSHIP ENABLING 
PRIVILAGE TRACKING ACROSS MULTIPLE TERMINALS," which h assigned u> 
the corporate assignee of the present invention. 

In one embodiment, the network operations center 130 includes a master proxy 
server 3S0. The master proxy server 380 handles and redirects requests to appropriate 
locations and servers. The master proxy server 380 interfaces with the master weh cache 
385, According to an embodiment, of the present invention, the master proxy server 3&0 
receives a request for a document from a local computing resource. The master proxy 
server 380 determines whether the document is stored in Lhe master web cache 335, If the 
document is stored in the master web cache 385, the document from the master web cache 
is delivered tn the local computing resource by the master pro*y server 380. If the 
document is not stored in the master web cache 3SS, or is out of dale, the master proxy 
server 380 retrieves the document nn hclialf of the local compelling resource. The muster 
proxy server 380 may cache a copy of the document in the master web cache 385, lhe 
master proxy server 3K0 determines where and how Lhe document is transmitted. It should 
be appreciated that tbo master web cache 385 may be implemented by any known caching 
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device. The document may l>c either unicasted or mtilticasted to ihu requesting Jocnl 
computing resource and other local computing resources as described in more detail 
lx:low. 

Jn order to reply to a unicast request via a multicast channel and recover potential 
packet loss, a sender (the master proxy server 3XC> ) buffers sent packets mti re-sends on n 
negative acknowledgment. By removing unnccded rep] ies for receipt acknowledgment 
while assuring complete intact delivery, IP ramp- up lag is minimized. 

The network operations center 130 includes a multicast server 390. The multicast 
server 390 sends documents or any fite or group of files to specified directories on pre- 
defined local servers via Internet Protocol (IP) Multicast protocols or otlier oue-to-many 
transmission protocols. The multicast server 390 manages multicast channels on which 
the documents arc transmitted. According to an embodiment of the invention, the local 
server 230 (shown in Figure 2) where functions and resources of die network operations 
center 130 may be replicated, the multicast server 390 packages and schedules reliable 
multicast file transfers of documents, such at* documents stored in the web cache 385 and 
in the network operations center data storage 325 onto the local server 230, It should be 
appreciated that the multicast server 390 may also multicast replicated web sires such as a 
directory tree of HTML and associated HTML files, such as J P KG, GIF, JAVA, tmd otlier 
files, from the web server 360, applications such us installable software packages from the 
applications server 34-0, and data and resources from other servers on the network 
operations center 130 or servers 150 connected to the Internet HO onto the local server 
230. 

It should be appreciated dial not all of the servers described *ire necessary for 
implementing the network operations center 130. According to an embodiment of the 
present invention, die network operations center 1 30 may be implemented using only the 
master proxy server 38(K the middleware server 370, and the database Server 320 with its 
associated data storage 325. 

Figure 4 is a block diagram of a computer system 400 aeeording to an embodiment 
of the present invention. The computer system 400 mey be used In implement the network 
operations center 1 10 (.shown in lugurc 1) in an embodiment nf the present invention 
where the network operations center 130 is implemented by a single computer system, 

in 
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The computer system 400 may be used to implement or one or more of the servers in the 
network operations center 130 in an embodiment of the present invention where the 
network operations center 130 is implemented by a network of computer systems. It 
should be; appreciated thai the- computer system 400 may also he used to implement one of 
the client terminals 210-213 (shown in Figure 2) and the local .server 230 (shown in Figure 
2). 

The computer system 400 includes a bus 430. The bus 410 may be a single bits or 
a combination of multiple buses. As an example, the bus 320 may include a Peripheral 
Component Interconnect (PCI) bus, an Industry Standard Architecture (ISA) bus, or a 
combination of other buses. A processor 420 is coupled to the bus 410. The processor 
420 processes data signals. 

A memoiy 430 is coupled to the bus 410. The memory 430 may store instructions 
find code represented by data signals that may be executed by the processor 420. A 
network controller 440 is coupled to the bus 41 0. The network control Ter 440 links the 
computer system 400 to a network of computers (hoi shown in Figure 4) via a connection 
(not shown) or a network communications unit (not shown in Figure 4) and supports 
communication among the machines. A display device controller 45(1 may he coupled to 
the bus 410. Hie display device controller 450 allows coupling of a display device to the 
computer system 400 and acts as an interface between the display device and the computer 
system 400. A data storage device 460 is coupled to the bus 4 10. The da I a storage device 
460 may be a hard disk drive, a floppy disk drive, a CD-ROM device, a flash memory 
device or other mass storage device or any combination thereof. An input device interface 
470 is coupled to the bus 410. The input device interface 470 allows coupling of a 
keyboard, pointing device, or other input device to the computer system 400 and transmits 
data signals from an input device to die computer system 400. 

II should be appreciated that not a! J the components described in the computer 
system 400 may be necessary for implementing one of the network operations center 130. 
a .server on the network operations center 130, a local server 230, or a client terminal at the 
local computing resoitrce 1 10. The present invention is related to the use of the computer 
system 400 to fitter and distribute documents and data. According ui one embodiment, 
filtering and distributing documents Ls performed by the computer system 400 in response 
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lo the processor 420 executing a sequence of hisirue lions in memory 430. Such 
instructions may be read into memory 430 from anolher computer-readable medium, such 
as dam storage device 460, nr from another source via the network controller 440. 
Execution of the sequence of instructions causes the processor 400 to filter a document, as 
will be described hereafter. In an alternate embodiment, hardwired circuitry' may bo used 
in place of or in combination with software iu5truclioa<5 to implement the present 
invention. Thus, the present invention is not limited to any specific combination of 
hardware circuitry and software. 

Figure 5 is a block diagram of a document manager unit 500 residing in die master 
proxy server 3K0 (shown in Figure 3 J according to an embodiment of Lhe present 
invention, hi a preferred embodiment of the present invention, the documenl manager unit 
500 is implemented by software and resides in main memory 430 (shown in Figure 4) as a 
sequence of instructions in a computer system 400 thai, may be implemented as an 
extension of the masrer proxy server 3*0 or a network operations center 130 having a 
master pro,\y server 380. It should he appreciated thai the document manager unit 500 
may also be implemented by hardware as components coupled to the bus 4 1 0 (shown in 
Figure 4). a combination of both hardware and software, or a plurality of computers 
working together as performance and traffic requirements dictate. 

The document manager 500 includes a session profile unit 5 tO. The session 
profile unit 510 interfaces with (he middleware server 170 (shown in Figure 3) to obtain 
session information relating to The session run al a local computing resource that is 
requeuing a document and currently active ubcr sessions as well as user and session 
information associated with the specific request for a. dot urn enl. In an alternate 
embodiment, the session profile unit 510 might query (Pie database server 320 (shown in 
Figure 3) directly to obtain this information. The real purpose of the middleware server 
370 is lo simplify getting information out of the database server 320, each server and 
elienl actually could talk to die database server 320 directly, but this would be a lot more 
wnrk and load on each of ihese systems. In one embodiment, the middleware server 370 
reports to the master proxy server 380 all new session ID as they arc assigned as well as 
the associated user and session privileges and any oiher relevant data. The master proxy 
server 3 SO also messages at lhe end of a session to purge the session TD from the masLer 
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proxy server's "Current session" table in memory, The session information include* 
information relating to a user thai is requesting the document and a set nf access privileges 
that belongs ro the user or session. The session profile unit 510 transmits the session 
information to the document transmission unit 550. 

In one embodiment, the document manager 500 includes a document profile unit 
520, The document profile unit 520 receives a copy of the document that constitutes the 
document requested by the local computing resource. The document profile unit 520 
obtains, generates and formats meta data relating to the content of each document that has 
been requested and allows the document transmission unit 550 to determine where and 
bow to transmit the documents. According to a Just embodiment nf (he present invention, 
ihe document profile unit 520 reads and re-formats recognized meta tags in the document 
that were placed there by the nriginal publisher of the dueurnent and provide subject 
matter, size, source, rating, keyword, or other information relating to the document anil 
forwards ihe ineta data to the document transmission unit 550. According to a second 
embodiment of the present invention, the document profile unit 520 reads the content of 
the document to determine relevant subject matter, size, source, or other information 
relating lo the document and forwards the meta data to the document transmission unU 
550. Jn another example, the document profile unit 520 may also interface with the master 
proxy server 380 (shown in Figure 3) on the network communications interface (o 
determine if the document being requested by n user has heen "popular" with a specific 
class of local computing resources and forwards this meta data to the document 
transmission unit 550. According lo this embodiment of the present invention, the 
document profile unit 520 may generate ils own meta tags in its own format that are 
transmitted with the document or in some other way append or associate its aggregated 
formatted meta data to ibe source document. Some of ihe data, tike source information, 
could be evaluated even before ihe document is received and hefp sj>ced the evaluation 
process. For example, an excluded source could redirect to a unicast reply even before ihe 
source document arrives for evaluation by the other unilx. The meta tags may be 
transmitted in the header of the document, the body of the document, or using other 
techniques. 
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III one embodiment, the document manager 500 includes a local computing 
resource (LCR) profile unit 530. The local computing resource profile unit 530 interfaces 
willi the middleware server 370 to obtain profile information relating lu local computing 
resources that are coiuiected to the network 100 (shown in Figure 1). [lie profile 
information includes information relating to (he subject matter, size, source of the 
document or other content associative parameters that a toca! computing resource would 
like to receive or explicitly rejects (e.g. documents published from a suspect source). The 
profile information may also include the identity of content class associated multicast 
cliannels on which ro transmit the document lu ihc local computing resource. The local 
computing resource profile unit 5.10 transmits the profile information to the document 
transmission uniL 550. 

In one embodiment, the document manager 500 includes a system status profile 
unit 540. The system stains profile unit 540 includes a list of local computing resources 
and the multicast channels each local computing resource are listening tii. The system 
status profile unit 540 also includes associated target content profiles for each of the active 
multicast channels. The system status profile unit 540 operates to inform rhe document 
transmission unit 560 of current transmission latencies for all acLivc multicast cliannels. 
According to one embodiment, the system status profile unit 540 monitors activity on 
multicast channels and transmits traffic information to the document transmission unit 
560. The system status profile unit 540 informs the document transmission unit 560, via 
the traffic information, if the multicast channel to which a document is to be rc-dirceied is 
backed-up beyond some maximal latency. In response, the document transmission unit 
500 may generate a imkast reply to the original requester since a multicast reply would 
require too much time due to an over subscribed multicast channel. In one embodiment a 
document may be placed in a queue for evaluation and possible multicast when traffic 
subsides. 

A document transmission unit 550 is coupled to the session profile unit 5) 0, the 
document profile unit 520, and the local computing resource profile unit 530, and the 
system status profile unit 540 in a way such that transferred information (mcta data) from 
the sources arrives with or is associated with the document for each requested document. 
Some might be appended as mcta tags in the bndy, and others as announcements prior to 
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the arrival of the document. Some of this compiled meta date should be passed on \a the 
receivers so as to keep them from having to rc-generats the meta data. This is further 
described below in relation to Figure 6. The document transmission unit 550 operates to 
determine where and how to transmit the requested document and Jaeilitates ihc 
transmission. The document transmission unit 550 receives session information from Ihc 
session profile unit 510, The session information may be used by the document 
transmission unit 550 to determine whether to transmit the document requested by a user 
at a local computing resource to the requesting local computing resource only using 
unicast techniques, to transmit the document requested to the requesting local computing 
resource and to other non-requesting local computing resources at the some lime using, 
multicast technique*, or Co transmit lite document requested to the requesting Jocal 
resource using unicasl techniques and to other non-rcqucstinB local computing resources 
usin^ multicast techniques at a later time or upon further determination, This decision 
might also lake into account network traffic and capacity as repot ted hy the system status 
unit (540), (e.g. the multicast channel is hacked-up and has a delivery- latency greater them 
a maximal value-) In such a ease the document must be sent back to the original requester 
via utiicnsl in order to avoid a request time-out on the client end, "The document can then 
be placed in a queue for evaluation and possible multicast latter on when traffic subsides. 
'Hie session information may include information relating to the user and access privileges 
that belong to the user. For example, a request for a document by a user with a high level 
of access privileges such as "instructor class access privileges" may effectuate the 
document requested to be multicasied to the requesting and non-requesting computiii£ 
resources at die same time and perhaps on n priority or reserved channel /address. A user 
with middle level access privileges such as "contributor class access privileges'* may 
effectuate the document requested to be unieasted to the requesting Jocal computing 
resource and inulticasled to non-requesting local computing resources at a later lime or 
upon further determination or sent on a lower priority channel. A user with low level 
access privileges such as "viewer class access privilege'^ may effectuate the document 
requested to be unicasicd only to the requesting local computing resource or only 
multicuslcd if the document itself triggers a multicast as described below. It should be 
appreciated that other schemes of determining where and how the document is transmitted 
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may be utilized. Such schemes may relate directly to the purpose and use of the network 
and the type of document being consumed. 

Hie document transmission unit 550 receives metadata from the document profile 
unit 520 and local profile information from the local computing resource profile unit 530. 
The mcta data and tlie profile information may be used by the document transmission unit 
550 to determine whether or not to cache a copy of a requested document onto the web 
cache 385. The meta da to and profile information may also be used by the document 
transmission unit 550 to determine whether or not to muhicasL the document to non- 
requesting local computing resources and if so to which ones by selecting one or more 
active multicast channels. In tine embodiment, the network operations center 1 30 
announces channel changes or temporary channel assignments to the appropriate local 
computing resources. Tor example, an "InsuueLor" in one location tells 10 students at ten 
different locations to look at a specific document which may contain a live multicasted 
video stream. The master proxy server 380 dynamically assigns a multicast channel to 
•>cnd the instructor's document back on, and at the same time, inform the ten other 
locations of this dynamic channel assignment (via a separate "Announcement" channel.) . 
According lo an embodiment of the present invention, die document transmission unit 550 
multicnsts a document lo a non- requesting local computing resource if the mcta data 
corresponding to the document matches the profile information for a currently active 
multicast channel. It shnuld be appreciated that udicr schemes for determining, whether or 
not to multicast the document lo non-requesting local computing resources may be 
utilized According tr> an allerxialc implementation, the network operations center 130 
rejMM'ts to all niaic-Jiiug local computing resources what channel that matching dneurnent is 
going to be scnl on, and when un a .separate ''Announcement 1 - multicast channel Ural all 
local computing resources listen lo. According to one aspect of" this embodiment, the 
network operations center 130 waits for *Vm listening" acknowledgements from oil target 
Jocal computing resources beibre sending the document. 

Jt should he appreciated that how, when and in what order mcta data, is sent to the 
local computing resource 1 10 may be important. Some meta data may he appended as re- 
formatted mcta lags, others may be sent prior to the receipt of the document su as lo allow 
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the computing resource 1 10 to make a fast decision that will allow the Jocal computing 
resource to run more efficiently. 

The document transmission unit 550 also facilitates the transmission uf documents. 
For example, if a determination h made dial the document being requested is to be 
unicasted back to the requesting local computing resource, even if it is also In be 
mullicasted to a set of non-requesting local computing resources, the documenl 
transmission unit 550 may redirect the server sourctng (serving/host In jjQ the document to 
transmit the document directly lo the requesting local computing resource or though a 
separate proxy server or connecting On the oilier hand, if it is determined that the 
network operations center 130 (shown in Figure 1) should multicast a copy of the 
requested document or examine the requested document the document transmission unit 
550 may direct the server soureing the document to transmit the document to hnth the 
requesting local computing resource and Hie ncLwurk operations center 130 or alternatively 
have the document transmitted only io the network operations center 1 3Q and have the 
network operations center 130 unnsmil the document to the requesting local computing 
resource. If a determination is made thai the document being requested is to he 
multicasted, the document transmission unit 550 transmits the document and an identity of 
an appropriate multicast channel in wliieh the document is lo he sent on to the multicast 
sei-ver 390 (shown in Figure 3). Tlie session identification as well as some or nil of the 
aggregated mcta data, and perhaps other session data like the class of the original 
requester, (e.g. "an instrticror you trust asked for this document" 1 ) is included with the 
document as it is transmitted. En an alternate emhodiment, the master proxy server 380 
may operate as the muhicasi server 390 directly by extending its features to handle re- 
siding packets on a negative acknowledgement. According to an embodiment of the 
present invention, the session identification is appended in a header of the document as a 
cookie. According to an alternate emhodiment of the present invention, the session 
identification is appended as a CGI variable. It should be appreciated that the session 
identification may he transmitted with (he document using other techniques. 

Tt should be appreciated that the document manager 500 may transmit a document 
to local computing resources with or without transmitting meta tags corresponding to ihe 
document, ll. should also be appreciated that the document manager 500 may multicast the 
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document tu local computing resources without taking into account ihc session 
information associated with the original request, metei data in file document, or profiles of 
local computing resource where the document is to be multicasted. As described, the 
document profile imit 520 may generate and append mela data. It should also be 
appreciated that the session profile unit 510, the local computing resource profile unit 530, 
and the system status profile unit 540 may also generate and append nicta data. 

The session profile unit 510. document profile unit 520, local computing resource 
profile unit 530, and the document transmission unit 550 may he implemented using any 
known circuitry or technique. In an embodiment of the present invention where the 
document manager unit 500 is* implemented in hardware, the session profile unit 51 0, 
document profile unit 520. local computing resource profile unit 530, and the document 
transmission unit 550 all reside in a single semiconductor suhsirate- In a preferred 
embodiment of the present invention, iho document manager 5O0 is implemented by 
software proxy server plug-ins or extensions, and/or packet filter processes. 

Figure 6 is u Mock diagram of a filtering agent 600 redding in the local server 230 
(shown in Figure 2) according to an embodiment of the present invention. The filtering 
agent 000 operates to filter document* received on one or more multicast channels by the 
network communications unit 250 (shown in Figure 2) of the local computing resource 
1 1 0 (shown in Figure 2), In a preferred embodiment of the present invention, the filtering 
agent 600 i> implemented by software and resides in main memory 430 (shown in Figure 
4) as a sequeuce of instructions in a computer system 400 dml may he implemented as the 
local server 230 (shown in Figure 2). It should be appreciated that the filtering agent 600 
may also be implemented by IianJwarc as components coupled u> ihc bus4J0 {shown in 
Figure 4), a combination of both hardware and software, or a plurality of computers 
working together. 

The filtering agent 600 includes a network communications interface 610. The 
network communications i nter face 610 interfaces with the network communications unit 
250 and receives multicast documents transferred over multicast channels from the 
network operations center 1 30. The network communications interface 610, could also 
receive broadcast notification and channel activation, de-activation, and content 
assignment changes. In addition it should report new local computing resource channel 
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subscriptions {c>z. multicast channel lhaL art; monitored at the local site) and profile 
changes to the network operations center lit), 

A session identification reading unit 620 is coupled to the network 
communications interface 610. The session identification reading unit 020 receives the 
document from ihc network communications interface 61 0. Tins process may he "made 
aware :: to listen for a session 10 or specific document via an prenannouncement from the 
network operations center 110, In one embodiment, there is a separate announcement and 
schedule multicast channel. Tn another embodiment announcements are send via unicast. 
The session identification reading unit 620 analyzes the document for a session 
identification associated with the document and forwards the session identification to on 
evaluation unit 650, The session identification may be used by the filtering agent 600 to 
determine whether the document is the reply to a local user's request or includes relevant 
information and how U> process the document. The session identification reading unit (520 
may bs configured to recormize that the session idcni5 lication may be transmitted with the 
document as a cookie appended in the header of the document, as appended custom meta 
tags, or in a separate transmission announcement, or usiny other techniques and search for 
the session identification accordingly. 

An information classification unit 630 is coupled to the network coironunicalions 
interface 6a 0. The in formation classification unit 630 receives the document from the 
network communications interface 610. The information class! ficatmn unit 630 analyzes 
the document for meta data relating to the content of the document. The meta data may be 
used by the filtering agent 600 to determine whether the d ocun icnt includes relevant 
information and how to process the document. All meta data needed may be appended to 
the transmission or announced separately by 550 from 520. According to a first 
embodiment of the present invention, Lbc information classification unit 630 reads meta 
tags in the document that provide subject matter, size, source, rating, keyword, or other 
information relating to the document and forwards the meta data to the evaluation unit 
650. The meta tugs may he puhlished with the document, appended the network 
operations center 1.10, or other entity process. According to a second embodiment of the 
piiescnt invention, the information classification unit 630 reads the content of the 
document lu determine relevant subject matter, size* source, rating, keyword or other 
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information relating to the document and Towards the metadata to the evaluation uniL 
650. It should be appreciated that Ihe second embodiment of the present invention may be 
implemented when meta lags with metn data arc not supplied by the source of [he 
document, the network operations center 130, or other process. The second embodiment 
of the present invention may also be implemented as a rcdundnut operation or secondary 
meta data gathering process that may differ from the mcta data gathering process in the 
network operations center 130, 

A source imil 640 is coupled to the network communications interface 610. The 
source unit 640 receives the docuincnL from the network communications interface 610- 
The source unit 040 analyzes the document for source information leJaiing lo where the 
doe um en L originated. The source information may be used by the filtering agent 600 (via 
the evaluation unit 650) to determine whether the document includes relevant information 
and how to process die document. The source information may include a URL* domain or 
other source information that may be written in the body of the document. The source unit 
040 transmits the source information to Ihe evaluation unit 650. 

The evaluation unit 650 is coupled to the session identification reading unit 620. 
ihe information classification unit 630, and the source unit 640. The evaluation circuit 650 
receives the session identification associated with die document from the session 
identification reading unit 620. The evaluation circuit 650 determines whether tile session 
identification corrcspnndK to a session being run by a client terminal on the local 
computing resource. Acenrding to an embodiment of the present invention, die evaluation 
circuit 650 accesses a session log Lo dcicrmine whether the session identification 
associated with (he document matches the session identifications that are active on the 
local computing resource. Jn one embodiment, an announcement informs the ova! nation 
Link 650 to look for a reply on a specific multicast channel. In an alternate embodiment, 
the session may be associated with a "Trusted User' 1 like an "Instructor class user" uj 
effect a push into cache. If the session identification associated with the document 
corresponds To a session being inn by u client terminal on the local computing resource, 
the evaluation unit 650 [ecognizes that the document is a multicast response to a unicast 
request for the document made by the local computing resource. The evaluation circuit 
650 forwards the document to the appropriate client terminal, 
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The evaluation unit 650 receives meta data associated with the document from Ihe 
information classification unit 630. The evaluation unit 650 determine* whether the 
document includes meta data that matches a local information profile of the local 
computing resource. According to an embodiment of the present invention, the evaluation 
unil 650 accesses a iocaJ information profile to determine whether the meta data associated 
with the document matches meta data. The evaluation unit fi50 may decide whether to 
cache or discard the document based on the determination- In one embodiment, the Jocal 
information profile may include inclusionary meta data Lhat would prompt the evaluation 
unit 650 to cache the document. The inelusionmy mcta dala may include anything on the 
subject Advanced Algebra for example. The meta data in the local information profile 
may also include exclusionary mcta data that would prompt the evaluation unit 650 tn 
discard the document. The exclusionary meta data may include keywords that include 
profanity, or subject matter that may be considered offensive or inappropriate tn the users 
at the local computing resource. It should be appreciated that the local information profile 
may be configured by a system administrator, compiled dynamically based on prior 
requests far the document, ot using other techniques or combinations of techniques. 

The evaluation unit 650 receives source information associated with the document 
from the source unit 640. The evaluation unil 650 determines whether the document 
origin nl cd from a "Trusted" source disit is recognized by the local computing resource. 
According to an embodiment of die present invention* the evaluation unit 650 accesses the 
local information profile to determine whether the source information associated with the 
document matches source information in the local information profile. The evaluation unil 
650 may decide whether to cache or discard the document based on the determination. In 
one embodiment, the source information in the local information profile may include 
indusiouary source information that would prompt the evaluation unit 650 lo cache the 
document. The inclusioriary source information may include a private index of web shea, 
JJK! domain, sub domains, IP addresses or ranees of addresses, etc. (e.g. a database of 
educational web sites such as the National Geographic Society, and the NASA web site 
etc,) The meta data in the local information prufile may aSso include exclusionary source 
information that would proinpL die evaluation unil 650 to discard the document. The 
exclusionary source Information may include adult oriented web sites, or web sites 
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containing information lhat may be considered inappropriate for the users at the local 
computing resource. Jt should be appreciated thai the local information profile may be 
configured by a system administrator, compiled dynamically based on prior requests for 
the document, or using other techniques such as Al aud/or pattern recognition and 
matching. For example* tile local information profile may he configured dynamically by 
an agent on the local seiver that includes URLs frequently accessed by user* on the local 
computing resources as inclusinnary source information. The local information profile 
may be later edited by a system administrator who may delete source information that may 
be inappropriate. 

According to an embodiment of the present invention, the evaluation unit 650 
includes a redirection unit 655. It should be appreciated that the redirection unit 655 may 
reside inside the ©valuation unit 650 as shown m Figure 6 or alternatively resids external 
to the evaluation unit 650. '1 he redirection unit 655 "spoofs" ov plays ihe role of a host 
server thai sources requested documents. According to an embodiment of Lhc present 
invention, the redirection unit 655 "'feeds'* or "serves" packets of data that constitute that 
document to a proxy at the local computing resource that believes it is talking to the host 
server. This allows the proxy to operate as it would if it hud made a unieast HTTP 
connection wjth the host server. According to an alternate embodiment of the present 
invention, the redirect unit 655 merges the proxy at ihe local computing resource that it 
has the document that was requested. In response, the proxy stops attempting to connect 
to the host server. The pruxy then plays the role of the host server to the cTient terminal 
requesting the document. 

It should be appreciated that die filtering agent 600 nmy utilize one or all of the 
session identification reading, unit 620, information classification unit 630, and source unit 
640 when determining whether a document includes relevant information and whether to 
forward the document to a client and whether to cache the document. Fur cx nm pie. the 
meto data from the in formation classification unit 63(1 and the source information from the 
source unit 640 may be used to deny a client terminal access to the requested document 
regardless nf whether a session identification match exists. Alternatively, a weighting 
protocol may he utilized in situations when the mela data and source information generate* 
conflicting incJusionnry/ exclusionary results, 

22 



Printed from Mimosa 03/03/19 13:07:09 Page: 24 



WO Oft/36497 



PCT/US99/30I32 



The network communication* interface 610, session identification reading unit 
620. in formation classification unit 63 1), source unit 640, Enid evaluation unit 650 may he 
implemented using any known circuitry or technique. In an embodiment of the present 
invention where the filtering agent 600 is implemented in hardware, the network 
communications interlace $10, session identification reading unit 020, information 
classification unit 630, source unit 640, and evaluation unit 650 all reside on a single 
semiconductor substrate. In a preferred embodiment of the present invention, the Uttering 
agent 600 is implemented by software proxy server plug-ins or packet filtering extensions 

Figure? is a flowchart illustrating a method for managing ei document at a local 
computing rcsourec according to an embodiment of the prescm invention. At seep 701, a 
request for a document is received from a client terminal. The request fur the document 
may be a request to receive an HTML file from a remote server. The request may include 
an (JUL. that corresponds to where the HTML file is located. 

At step 702, it is determined whether the document requested is stored locally al 
the local computing resource. Determining wlicLhcr the document requested is stored 
locally may be achieved by checking a web cache or other storage device at the local 
computing resource, which may need to check with die original host to see if the cached 
copy is up-to-date. If the document requested is stored locally and is not expired or out- 
of-date, control proceeds to step 703, If the document requested is not stored locally or is 
expired or out-of-date, control proceeds to step 704, 

At step 70j, the document requested is forwarded from the storage device to the 
client terminal and a "Hit" notification is sent to the original host, usually via the master 
proxy server. The request for the document is not transmitted lu the source of the 
document. According to an embodiment nf the present invention, lhc request is logged in 
a local computing resource information profile. 

At step 704, Hie request for the document is transmitted or redirected to an external 
system. According to an embodiment of the present invention, the request fur the 
document is transmitted to a master proxy server on a nclwork operations center which has 
a connection to the source of the document or its own cached copy of Hie document or a 
mirror of the original host server. The network operations center may operate as a master 
proxy server or an I SI 1 fur the local computing resource. When transmitting the request 
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for a document to the network operations carter, the local computing resource niny also 
transmit session information that includes or points to information about the user 
requesting the document such a* a session identification- According to an alternate 
embodiment of tlie present invention where the local computing resource has a connection 
to the source of the document, the request for a document is transmitted to the source of 
the document witli instructions in return the requested document to the network operations 
center 130, such request containing the session identifier to be read by the session profile 
unit (5 JO) and session ID reading unit {620 J. 

At step 705, a document received from a unicast channel is forwarded to a client 
terminal requesting the document or a unicast or multicast announcement of a multicast 
transfer ot'the requested document ''Alerts" the session identification reading unit la look- 
out. Alternately or simultaneously, as part of step 704, the session JD and document 
request arc placed in a ■« Waiting for reply" queue that is read by the session identification 
reading unit and acts as die announcement to look-out for that same session ID on all 
monitored multicast efmnnclg. 

At step 706, it is determined whether a document received from a multicast 
channel contains relevant information anchor is a reply to a local user's request and has a 
session TD associated with a client terminal that location. According to an embodiment of 
lliw present invention, determining whether the document received includes relevant 
in form ati on includes determining whether the document may be used by a client terminal 
associated with a locat computing resource. If the document received does nor contain 
relevant information, control proceeds to step 707. If the document received contains 
relevant information, control proceeds to step 70S. Figure 8 is a block diagram illustrating 
the steps of bow determining whether a document received contains relevant information 
is achieved according to an embodiment of the present invention. 

At step 707. the document is discarded. 

At step 70S. the document is consumed or stored for latter consumption by the 
local computer resource. According to an embodiment of the present invention* the 
document received is determined to be requested by a client terminal and is forwarded co 
the client terminal. The document requested may be received from either the network 
operations center nr the source of the document directly depending on how the request was 
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made. 11 should be Appreciated that the document requested may be unicastcd or 
mullieastcd to the local computing resource and (hat die local computing resource may 
examine the requested document using known tecliniques to determine rbe identity of die 
client terminal to forward the document. The document received may also be cached 
locally oh a storage device. 

Figure 8 is a flowchart illustrating a method for filtering ft document at a local 
computing resource according to aii embodiment of the present invention. At step SOU a 
session identification is obtained from the document or ait associated announcement. 
According to an embodiment of the present invention- the session identification may be 
obtained by a session identification reading unit 620 (shnwn In Figure 6). The session 
identification reading unit 620 analyzes the document for a session identification 
associated with die document. The session identification reading unit 620 may be 
configured 1o recognize that the session identification may be transmitted with the 
document as a cookie appended in the header oHhe document, unique mela lays 5n the 
body, or using other techniques and search for the session identification accordingly. 

At step 802, it is determined whether the session identification associated with the 
document corresponds with on active session run by a client terminal on a local computing 
resource. According to an embodiment of the present invention, the determination is made 
by an evaluation circuit 650 (shown in Figure C). The evaluation circuit accesses a session 
log to determine whether the session identification associated with the document matches 
the session identifications that are active on the local computing resource. If die session 
identification associated with the document matches a session run by a client terminal 
control proceeds to step 803. If die session identification associated with the document 
doe?; not match a session run a the client terminal, control proceeds to step 804. 

At step 803, the document is forwarded to the client terminal. If the session 
identification associated with (he document corresponds tn a session being run by a client 
terminal on the local computing rcwiiirce % the document is recognized as a multicast 
response U> a unicast request for the document made by the client terminal. The. local 
proxy must spools a unicast reply U) the original request. 

At step K04, mcta data associated with a transferred document h obtained. 
According to an embodiment of the present invention, tbe tnera data may be obtained by 
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an information class iiication unit 630 (shown in Figure 61 The information classification 
unit 630 may obtain mcta data from the document from meta tags in the document or by 
rending the content of the document to determine relevant subject, matter, size, source, 
rating, keyword or other information relating to the document* 

At step 805. it is determined whether the meta data associated with the document is 
of interest to the local computing resource. According to an embodiment of the present 
invention determining whether the meta data is a match for local interests is performed by 
the evaluation unit 650. The evaluation unit 650 accesses a local information profile to 
determine whether the meta data associated with the document matches mcta data in the 
local information profile. The evaluation unit 650 may decide whether to cache or discard 
the document based on the determination. The local information profile may include 
inelusionary or exclusionary mcla data that may be configured by a system administrator, 
dynamically based on prior information consumption, or using oilier techniques or 
combination of techniques. If the meta data associated with the document is recognized as 
mchLsionary, control proceeds to .step 806. If the meta data associated with the document 
is not recognized, control proceeds to step 307. It .should be appreciated that even if the 
document i* of interest to the local computing resource, if the document includes a meta 
tag which indicates that the document should not be cached, control proceeds to step 809. 

At step 806, the document is pushed into cache. According to an embodiment nf 
ihe present invention when the proxy at llie Focal computing resource believes it has a 
unicast connection with the hosL server, the document is pushed into the cache. In an 
alternate embodiment of the present invention, the redirect unit 655 (shown in Figure 6) 
pretends to be a client terminal and requests the document from the proxy at the local 
computing resource. The redirect unit 655 intercepts Ihe proxy's forwarded request for the 
document and pretends tu be a host server having the document. The proxy caches the 
document as it forwards the document back to the redirect unit, which just discards it. 

At step 807, source information is obtained. According! to an embodiment, of the 
present invention, the source information may be obtained by the source unit 640 (shown 
in Figure 6). The snuree unit 640 analyzes the document for source information relating to 
where the document originated from. 
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At step 808, it is determined whether the source information associated with the 
document is of interest to the local computing resource. According to an embodiment of 
the present invention, determining whether the source information is of interest to the local 
computing resource is performed by the evaluation unit 650- The evaluation unit 650 
accesses a local information profile to determine whether the source information 
associated with the document matches source infnrnialion in line local information profile. 
The evaluation unit 650 may decide whether to cache or discard the document based on 
Ihc determination. The local informalion profile may include inclusionary or exclusionary 
source information fhnt may be configured by a syMcm administrator, dynamically based 
on prior information consumed, or usinie other techniques or combination of techniques, If 
the source information is recognized as hiclusionary. comrol proceeds to step 806. If the 
source information is not recognized or h recognized as exclusionary > control proceeds u> 
step 809. 

At seep 809, the document is discarded, 

It should be appreciated rlwt the steps in Figure 8 may he performed in an order 
other than that which is illustrated. l ; or example, the steps of determining whether the 
meta data is recognized and whether the source information is recognized may he 
performed at the same time or in opposite order. A process Lhut resolves any discrepancies 
in the results may he implemented. 

Figure 9 is a flowchart illustrating a method for managing a document at u 
network operations center according to an embodiment of the present invention. At step 
90], a request for a doa jroent is received from a local computing resource. The request 
for the document may be a request to receive an HTML file from a remote server. The 
request may include an URL that corresponds to where the HTML file fa located. 

At step 902, it is determined whether the document requested is stored locally at 
the network operations center. Determining whether the document requested \s stored 
locally may be achieved by cheeking a web cache or other storage device at the network 
operations center to determine whether the document requested was previously stored 
locally. If the document requested is stored locally r control proceeds to step 903. In one 
embodiment the original host is cheeked to assure the cache copy is fresh (Up-to-date) 
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before proceeding to 903. If the document requested is not stored locally, or is out-of- 
date, control proceed;? to step 904. 

At step 903, the document requested is forwarded from the storage device lo the 
client terryiinal. The document is not retrieved from the source. In one embodiment, a 
"I lit" may be sent to the hust to allow the host to account for the fact that the document 
was requested. According lo an embodiment of the present invention, the stored document 
may be checked to see if its time stamp is up to date by checking with the original host. If 
Hie time stamp is not up to date, control may proceed to step 904, hi another embodiment, 
step 903 might additionally check to sec if the document has been evaluated for 
multicasting, and if not jump to step 907 in order lo consider multicasting the cached 
document lo non-requeuing locnl computing resources. In this case, if 907 returns a no, 
then 903 proceeds as normal. 

At step 904, a detenu i nation is made as to whether the user requesting the 
document lias appropriaLC access privileges to effectuate m ulti casting the document to 
other local computing resources upon reeeipl of the document. The determination may be 
made by examining session information corresponding to die request. The session 
information includes information about the user and access privileges that belong lo die 
user. If there are insufficient access privileges to effectuate mnllk-nslmg the document 
upon receipt, control proceeds to .step 905. If there are sufficient access privileges to 
effectuate multicasting the document upon receipt, control proceeds to step 906. 
According to nit embodiment of the present invention, the examining session information 
may he achieved by a session profile unit 510 (shown in Figure 5). 

At step 905, the request for the document is forwarded to the source host for 
unicasl return lo the original requesting terminal. The session identification should still be 
appended hy the client. 

At step 906, tile request for the document i* transmitted to the source of the 
document with directions to direct ihe requested document back to the network operations 
center, such as a reply address. Session identification is still appended, 

A( step 907. it is determined whether the multicast channel winch the document is 
lo he transmitted on is in proper working order. If for example, the multicast channel is 
nut active or is experiencing Ifirge latency, control proceeds to step 905. If the multicast 
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channel is in proper working order control proceeds lo .step 908. According to an 
embodiment of the present invention, determining whether the multicast channel is in 
proper working oiricr may be achieved by the system status profile unit 540 (shown in 
Figure 5). 

Al step 90?, a determination is made as to whether the user requesting the 
document lias appropriate access privilege to effectuate multicas Ling the requested 
document to some or all local computing resource* on the network by sending on nne or 
more active multicast channels. The determi nation is made by examining session 
information corresponding to the request. If sufficient access privileges exist, control 
proceeds to step 913. If sufficient access privileges do not exist, control proceeds to step 
909. According lo tin embodiment of the present invention, examining session 
information corresponding, to the request may be achieved by the session profile unit 5 JO, 
In an alternate embodiment control may proceed to step 91 3 directly and select from a ii.st 
of active multicast clmnncly to transmit the document* 

At step 909, meta daLa relating to the requested document is obtained. Step 909 
can not proceed until the entire document is im;ived in response to step 906. The meta 
data may include information relating lo ihe subject matter, size, source, keyword, format, 
rating, or other data. According to an embodiment of the present invention, the meta data 
may be obtnined from meta tags in the requested document. According to an alternate 
embodiment of the present invention, meta data may be obtained by reading the content of 
the document. In one embodiment, obtaining meta data may be achieved by a document 
profile unit 520 (shown in Figure 5). 

At step 910* profile information relating to the Jocal computing resources on the 
network are obtained. The profile information may include information relating to the 
subject matter, size, source, key word, rating or other data that the local computing 
resources would like to receive. The profile information may also include the identity of 
multicast channels on which lo transmit the document lo the local computing resources. 
Accenting to one embodiment of the present invention, obtaining profile information may 
be achieved by a local compiiLing resource profile unit 530 (shown in Figure 5). 

At step 91 1 . a determination is made as to whether the requested document is a 
relevant document co multicast lo non-njq nesting local computing resources on one or 
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more active multicast channels. According to an embodiment of" the present invention, the 
determination is made by comparing the mcia data with active multicast channel profile 
information for matches and/or exclusions. It should he appreciated that other techniques 
ibr making the determination may also be utilized, if it is determined that the requested 
document docs not include relevant data to any group or class of local users nor mntch any 
active multicast cliamicl profile, control proceeds to step 912. If il is determined that the 
requested document includes relevant data, control proceeds to step 913. According to 
one embodiment of the present invention, determining whether the requested document 
jnel udes relevant data is achieved by a document transmission unit 550 (shown in Tiguic 
5). 

At siep 912, the requested document is jiul muHicasted to non-rcquc*tin£ local 
computing resources. Instead, the requested document is transmitted to the requesting 
local computing resource using unicast techniques. 

At step 913, the requested document is multicastcd on appropriate multicast 
channel (*) ui appropriate local computing resources. Step 913 can not proceed until the 
entire document is received in response to step 906. In an alternate embodiment, the 
document can be sent to all matching local computing resource by announcing to tliern the 
channel and time of multicasting of that document and perhaps the associated session 
identification to look for. The local computing resources may filter the document to 
determine for itself whether die document includes relevant, information. The requested 
document is also multic&sied lu the local computing resource requesting the document. 
Alternately, the document could be uiricaslud back to the original requesting local 
computing resource. The multicasting may he achieved by using batch processing where 
other documents stored locally on the web cache are tnultieasted with the requested 
document to the appropriate loon! computing resources. 

At snip 914. the requested dneumcm may be cached locally at the network- 
operations center. 

It should be appreciated that the network operations center managing the dueumcnL 
may be used only for performing the steps for supporting multicasting and/or uni easting as 
set out in steps 907-9 13 . In an embodiment of the present invention where a local 
computing resource has a connection to a source of the requested document the local 
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computing resource may redirect the source of Che requested document to transmit the 
document to the network operations cenleT where steps 907-914 are performed to provide 
a multicast response to a unicast reqxiesL 

In the foregoing description, the invention is described with reference lo specific 
exemplary embodiments thereof. It will, however, he evident iliat various modifications 
and changes may be made thereto without departing from the broader spirit and scope of 
the present invention ius set forth in the appended claims. The specification nnd drawings 
nrc accordingly U? be rejiarded in an illustrative rather than in a restrictive sense. 
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CLAIMS 
Wliai is claimed is; 

1 . A method for filtering a document comprising: 
reecivhig the document off of o multicast channel; 
determining if die document includes rcle\*ant information; and 
processing the document if the documents includes relevant information. 

2. The method of Claim I, wherein determining if" the document includes 
relevant information comprises: 

obtaining a session identification associated with a request for the document; 
determining whether the session identification corresponds to an active session run 
on a client terminal. 

3. The method of Claim 1 , wherein determining ifihc document includes 
relevant information comprises: 

obtaining meca data associated with the document; and 

determining whether the mcla data is of interest to a loeai computing resource, 

4. The method of Claim 3, wherein obtaining the meta data associated with 
the document comprises reading published meta Ugs in the document for the metadata. 

5. The method of Claim 3 # wherein obtaining the meta data associated with 
the document comprise,*! analyzing the content of the dueumenl. 
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6. The method of Claim 3, wherein determining whether the weta data i* of 
interest to Lhc local computing resource comprises referencing a local information profile 
to determine whether lhc mcta data is indexed in the local information profde. 

7. The method of Claim J, wherein determining af the doeumcnl includes 
relevant information comprises: 

obtaining source information associated wjth the document; and 
determining whether the source information is of interest to a local computing 
resource. 

8. The method uf Claim 7. wherein dclerminrng whether the <fuuree 
information is of interest to (he local computing resource comprises referencing a local 
information profile to determine whether the source information is indexed in the local 
information profile. 

9. The method of Claim I. wherein processing the document comprises 
forwarding the document to a client terminal that requested the document. 

1 0. The method of Claim 1 , wherein processing the document comprises 
caching the document in a storage device, 

11. A computer-readable medium having stored thereon a sequence of 
instructions, the sequence of instructions including instructions which, when executed by a 
process [ir, causes ths processor to perform the steps of: 

receiving a document off of a multicast channel; 
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determining if the document includes relevant information; and 
proccswins (be document if the document includes relevant information. 

1 2. The computer-readable medium of Claim 1 i, wherein determining if thu 
documcnl includes relevant information comprises: 

obtaining a session identification associated with a request lor the document; 
determining whether the session identification corresponds to an active session run 
on a client terminal 

1 3. The computer-readable medium of Claim I J , wherein determining if the 
doc amen l includes relevant information comprises: 

obtaining mcta data associated wiih the document; and 

determining whether the meta data is of interest to a locat computing resource. 

14. The computer-readable medium of Claim 13. wherein obtaining ihe meta 
dala associated with the document comprises reading published meta tags in the document 

1 5. The computer-readable medium of Claim 1 3, wherein obtaining the meta 
data associated with the document comprises analyzing llie content of the document. 

16. The computer-rcadahle medium of Claim 13, wherein determining whether 
the meta data is of interest to the local computing resource comprises referencing a local 
information profile to determine whether the mcta da I a is indexed in the local information 
profile. 
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1 7. Vhc computer-readable medium of Claim 1 1 . wherein determining if the 

document includes relevant information comprises: 

obtaining source information associated with ihc document: find 
determining whether the source information is of interest to n Jocal computing 

resource. 

IS. I hc computer-readable medium of Ciaim 17, wherein determining whether 
the sourec information is of interest to ihc local computing resource comprises referencing 
a local information profile to determine whether the source in formation iy indexed in the 
loertl information profile. 

1 9. The computer-readable medium of Claim 1 1 , wherein processing the 
document comprises forwarding the document to a client terminal that requested the 
document. 

20. The cnmpulcr^rcadable medium of Claim 1 1 , wherein processing the 
document comprises caching the document in a storage device, 

21. A filter i tin agent, conipri sing; 

a session identification reading unit that retrieves a session identification from a 
document transferred over a multicast channel; 

an information classification unit that retrieves mera data from the document; 

a source unit that retrieves source information from the document; and 

an evaluation unit, coupled to the session identification reading unil, information 
classification unit, and the source unit, that determines whether the document includes 
relevant information based on (he .session identification, mcla data, and the source 
information. 
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